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(57) Abstract 

A computer system for converting a text message into an 
audio representation, tnrhiHfatg a text to speech co nvert er and a 
text reader control procedure. A classifier procedure, called by the 
control procedure, classifies each word in the text message with 
regard to syntax, punctuation and membership in predefined classes 
of words, the predefined classes of words including numbers, 
measurement units, geographic designations, and date/time values. 
The classifier procedure generates classification values for each 
of the words In the text message* An abbreviation expanding 
procedure, called by the control procedure after execution of 
the classifier procedure, identifies abbreviation words in the text 
message mat match predefined abbreviations and identifies a set of 
potential expansion values for each identified abbreviation word. 
The abbreviation expanding procedure evaluates, based on the 
classification values for words neighboring the identified words, 
which, if any, of die potential expansion values is applicable, 
and substitutes the potential expansion value for the identified 
abbreviation word when the evaluation yields a success value. An 
Bxrcmym/initialism expanding procedure identifies words in the text 
message that are acronyms and lnitialisms, parses pronounceable 
syllables within the identified words and generates a substitute 
string in the form of a sequence of units. Each unit in the generated 
sequence is a letter, number, pronounceable syllable or a multiple 
letter identifier. The control procedu r e passes to the text to speech 
converter for each word in the text message (A) the expansion 
value, if any, generated by the abbreviation expanding procedure, 
(B) the substitute string, if any, generated by the acrccym/initialism 
expanding procedure, and otherwise (Q the word. 
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TEXT TO SPEECH READER 
FOR ELECTRONIC MAIL MESSAGES. ADDRESS BOOKS AND THE LIKE 

The present invention relates generally to text to speech converters, and 
particularly to an extension of text to speech technology for improving the 
handling of text that includes abbreviations, acronyms, infflalisms, 
non-standard compound words, numbers, dates, telephone numbers, 
5 fractions, times, and the like. 



BACKGROUND OF THE INVENTION 



10 The use of text to speech (TTS) converters is well known. TTS converters 
have been used to Improve access to computer stored information by 
visually impaired persons, and for uses such as in "interactive voice 
response" (IVR) systems in which a remotely located user accesses digitally 
stored information in a database via a telephone. 



15 



The present invention is directed to computer applications in which voice 
access to information stored in a computer is desirable, and more 
particularly to systems that provide voice access to information that by its 
very nature tends to include a wide variety of formats and individual writing 
20 styles, including conventional and nonstandard abbreviations, acronyms, 
initialisms, numbers, dates, times and telephone numbers in many formats, 
fractions, inappropriate spacing, emotion indicators (e.g., smiley faces, 
asterisks and underlines), and the like. 
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Present t xt to speech converters can produce intelligible speech only from 
text which conforms perfectly to the spelling and grammatical conventions of 
a language. Even the highest quality text to speech converters cannot read 
typical electronic mail (e-mail) messages intelligibly. Unlike carefully edited 
5 text, e^nall messages frequently contain sloppy, misspelled text with random 
use of case, spacing, fonts, punctuation, emotion indicators and a 
preponderance of industiy-specific abbreviations and acronyms. In order for 
text to speech conversion to be useful for such applications, it must 
implement flexible, sophisticated rules for intelligent interpretation of even 
10 the most ill-formed text messages. 

The difference between an acronym and an initialism is as follows. An 
acronym is formed from pronounceable syllables, even it is represents a 
"made up word," while an initialism is not pronounceable except as a 
15 sequence of letters and numbers. Thus, "IBM" and "YMCA" are initialisms, 
while "NASA" and "UNICEF" are acronyms. Some words, such as 
"MSDOS" and "PCNET" are a mix of acronym and initialism components. 

An application that provides telephone access to the user's electronic mail 
20 (e-mail) messages using a standard TTS converter will provide inadequate 
service to end users because a standard TTS converter (A) will attempt to 
read portions of the e-mail message that the end user does not want to 
hear, such as the e-mail address of the sender and recipient and the trail of 
network nodes through which the message was transmitted, and (B) will not 
25 property handle many of abbreviations, acronyms and initialisms typically 
found in such messages. Moreover, when something is mispronounced, it 
confuses the listener and makes it difficult to understand portions of the 
message that are pronounced correctly. 

30 Other voice access applications which are inadequately handled by standard 
TTS converters include applications for telephone access t a user's 
personal telephone and address directory or an organization's telephon and 
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address direct ry, applications for verbal access to information in a 
8preadshe t, proofreading, and access to computer stored documents by a 
blind person, and applications which provide audio feedback in addition to 
visual display. 

5 

It is a fact of life that written text In the English language, as well as written 
text in most other languages, includes a rather large number of 
abbreviations, measurement values, times and dates and other symbols that 
the average well educated person knows how to read, but which do not 
10 follow the phonetic pronunciation rules applicable to most standard words in 
the applicable language. Furthermore, many common abbreviations are 
ambiguous until the context of the abbreviation is determined. For instance, 
the string "SF might mean "San Francisco,' "Sioux Falls," or "Santa Fe." 
Another example is that the text: 

15 

the 20 ft wayne jumped at the ft wayne Indiana meet. 

is 1ft worth buying? is ft worth ahead? dallas 27, ft worth 13 :-) 

i believe dr Jones lives on oak dr. 

I live at 6e maple st and my mother lives at 6 e st. 

20 

is correctly spoken by the present invention as if it were written: 

The 20 feet Wayne jumped at the Fort Wayne Indiana meet 
Is 1 foot worth buying? is Fort Worth ahead? Dallas 27, Forth 
25 Worth 13, ha ha. 

I believe Doctor Jones lives on Oak Drive. 

I live at 6 East Maple Street and my mother lives at 6 E Street. 
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Identical symbols can have different meanings and pronunciations in 
diff rent contexts. For instance, quotation marks can be used as the inch 
unit identifier, and for seconds of time and seconds of angular measurement: 



Another large category of troublesome text concerns the differences 
between acronyms and initlalisms. For instance, the term "NASA" is an 
acronym that is pronounced like a word, while the term "IBM" is an initialism 
that is pronounced as a sequence of three letters. No product, or literature, 
known to the inventors discloses a systematic method of distinguishing 
between acronyms and initialisms and how that should be accomplished for 
purposes of text to speech conversion. 

In summary, the number of examples of text that are problematic for 
standard TTS converters is large. 

It is a primary object of the present invention to provide a system and 
method for systematically pre-processing text that would not be properly 
converted into spoken words by a conventional TTS converter so as to 
produce substituted text that, when processed by a conventional TTS 
converter, will represent the words that would be spoken by a human reader 
of the same text. 



IflxJ 

'Four score and " 



Description 
Quoted text 

measurement in inch units 
angular or directional measurement 



3.5" 



27 , 35" 



SUMMARY OF THE INVENTION 



30 



In summary, the present invention is a computer system for converting a text 
message into an audio representation, including a text to speech converter 
and a text reader control procedure. 
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A classifier procedure, called by the control procedure, classifies each word 
In the text message with regard t syntax, punctuation and membership in 
predefined classes of words, the predefined classes of words including 
numbers, measurement units, geographic designations, and date/time 
5 values. The classifier procedure generates classification values for each of 
the words in the text message. 

An abbreviation expanding procedure, called by the control procedure after 
execution of the classifier procedure, identifies abbreviation words in the text 

10 message that match predefined abbreviations and identifies a set of potential 
expansion values for each identified abbreviation word. The abbreviation 
expanding procedure evaluates, based on the classification values for words 
neighboring the identified words, which, if any, of the potential expansion 
values is applicable, and substitutes the potential expansion value for the 

15 identified abbreviation word when the evaluation yields a success value. 

An acronym/inWalism expanding procedure identifies words in the text 
message that are acronyms and inhialisms, parses pronounceable syllables 
within the identified words and generates a substitute string comprising a 
20 sequence of units. Each unit in the generated sequence consists of e letter, 
number, pronounceable syllable or a multiple letter identifier. This procedure 
can also be applied to expanding e-mail user names such as bjmaslin and 
cfwong to B J Maslin and C F Wong. 

25 The control procedure passes to the text to speech converter for each word 
in the text message (A) the expansion value, if any, generated by the 
abbreviation expanding procedure, (B) the substitute string, if any, generated 
by the acronym/inWalism expanding procedure, and otherwise (C) the word. 



30 
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BRIEF DESCRIPTION OF THE DRAWINGS 



Additional objects and features of the invention will be more readily apparent 
from the following detailed description and appended claims when taken in 
5 conjunction with the drawings, in which: 

Figure 1 is a block diagram of a computer incorporating an enhanced text 
reader in accordance with the present invention. 

10 Figure 2 is a conceptual data and control flow diagram representing the flow 
of data between software modules in a preferred embodiment of the present 
invention. 

Figure 3 is a flow chart of the enhanced text expansion and conversion 
15 process in a preferred embodiment of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

20 Referring to the Figures, a computer system 100 incorporating a preferred 
embodiment of the present invention can be any conventional computer as 
shown in Figure 1, or can be a distributed computer system having one or 
more servers and multiple workstations. For purposes of explaining the 
invention, it will be assumed that the invention is incorporated in a single 

25 workstation. 

The computer system 100 includes a central processing unit 102, primary 
memory 104 and secondary computer memory 106 subsystems, and a user 
interface 108. The user interface 108 typically includes a display 110, a 
30 microphone 1 12, an audio speaker 1 14, a pointing device 116 such as a 
mouse r trackball, and a keyboard 118 for ent ring text. 
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The computer memory 106 stores a number of different programs, 
sometimes herein called procedures, and data structures. These programs 
and data structures, which are discussed in more detail below, Include the 
computer's operating system 120, application programs 122, one or more 
5 text files 124, an e-mail or other text database (e.g., of messages in the 
user's e-mail "mailbox") 126, and a text reader module 128. 

The text reader module 128 includes a conventional text to speech converter 
130, such as the Prose 4000 made by Centigram Communications Corp., a 

10 text reader control procedure 132, a header/body message analyzer 

procedure 134. a text classifier procedure 136, classifier output tables 138, a 
text expander procedure 140, a number expander procedure 142, a table of 
frequently used unambiguous words 144, an abbreviations translation table 
146, a city-state table 147, a set of abbreviation expansion procedures 

15 (sometimes called rules) 148, an acronym and inftialism expander procedure 
150, and a morph table 152. 

The computer system 100 also will typically include a communications 
interface 154, such as a modem or network interface, for telephone or other 
20 electronic communication with a remotely located computer or telephone 
156. 

Operation of the preferred embodiment will be explained by following the 
processing of a message by the enhanced text to speech reader 128. In 

25 particular, we will assume that a user of system 100 at a remote location 
calls into computer system 100 via communications interface 154. 
Application 122 receives the call and commands from the user, entered 
either using DTMF tones from a telephone keypad or verbally (if the 
application includes a voice to text converter recognizer). When the 

30 invention is used by a local user at the site of the computer 100, commands 
will more typically be entered using a mouse pointer device 116 and/or 
keyboard 118. Furthermore, the user requests that the application 122 
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"read" aloud an e-mail massage having a complex header and sev ral 
paragraphs of text in the body of the e-mail message. 

The application program calls (step 200, Figure 3) the text reader module 
128, and in particular, makes a procedure call to the text reader control 
procedure 132 specifying a text to be read aloud and the type of the 
message. In this case the message type is 'email*. The other message 
types supported in the preferred embodiment are 'address book,' 
■spreadsheet" and 'regular*, although the invention can be used with other 
types of messages as well. The primary significance of the message type is 
that for e-mail messages, the message header is parsed to remove 
information irrelevant to most end users. Other message types are 
supported by other special processing, e.g., If the message type is 
•spreadsheet,* the string *5/7" would be expanded to five sevenths* 
whereas for other message types *5/7* might be expanded to "May seventh." 

Additional message types are expected to be implemented in future 
embodiments of the invention, in particular, domain-specific or 
application-specific embodiments with specialized abbreviation tables are 
expected. For instance, an e-mail reader for a law firm or law department 
would include an abbreviation translation table having abbreviations 
frequently used in that profession, while an e-mail reader for a bio-tech 
research company could be customized to include an abbreviation 
translation table with abbreviations frequently used by biologists, chemists 
and genetics researchers. 

Next, if the message type is "email", which it is in our example, the text 
reader control procedure 132 calls a procedure called the header/body 
analyzer 134 (step 202). The head/body analyzer is, in essence, a filter that 
filters out all portions of the e-mail message header except (A) the name 
following the "From" label in the header, but excluding the e-mail address 
after that name, (B) the date following the "Date' label in the header, and 
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(C) the text following the "Subject" label in the header. The date In th 
"date" line Is processed by the number expander procedure 142 ( .g., 
expanded from "27/05/94" to "May twenty-seventh, 1994") and then passed 
to the text to speech converter 130 for conversion to speech. The subject 
5 header line is processed in the same manner as all other portions of the text 
message, as will be described next. 

The text reader controller procedure processes text, other than an e-mail 
message header, one paragraph at time. Each paragraph is passed, in 
10 sequence, to the text classifier procedure (204) and then to the text 

expander procedure (206). The text In each message paragraph is thus 
processed in two passes. In the first pass, every word in the paragraph is 
analyzed and classified. 

15 Text Classifier - First Pass Processing of Text 

The text classifier procedure 136 generates eleven arrays 138 of 
classification values, each of which stores a single element for each "word- 
In the paragraph being processed. A word, for purposes of the enhanced 
20 text reader 128 is any chunk of text that requires separate processing. For 
instance, in strings having special punctuation such as slashes, hyphens, 
and colons, each text element separated by such punctuation is treated as a 
separate word. Thus the string "3/12/95" is treated as three words: 1) "3T, 
2) "12/" and 3) "95". 

25 

As indicated above, each word in the text being processed is assigned 
eleven classification values by the text classifier procedure 136 and those 
eleven values are stored in eleven classification arrays 138. The definition 
of the classification values stored in each classification array 138 is listed In 
30 Table 1 . The function of the Text Classifier procedure 1 36 is to compute 
values often needed by the Text Expander procedure 140 by classifying 
each word in the text message with regard to syntax, punctuation and 
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membership in several predefined classes of words, the predefined classes 
of words including numbers, measurem nt units, geographic designations, 
and date/time values. More specifically, the Text Classifier procedure 136 
determines and classifies the syntax aspects of each word with respect to 
5 capitalization and punctuation and also makes initial determinations as to 
whether each word may be a member of several special classes of words, 
such as numbers, dates, times, geographic names, measurement units, 
measurement modifiers and the like which may require special processing 
by the Text Expander procedure 140. 

10 

The provision of the Text Classifier 136 simplifies the Text Expander 
procedure 140 since many of the word type and syntax determinations 
needed for text expansion decisions are pre-computed by the Text Classifier 
procedure 136. 

1 5 

The following are descriptions of the values stored in each of the 
classification arrays 138 by the Text Classifier 136. 

The FPER ("following period") array stores a yes/no indication for each text 
20 word, where "yes" indicates that the word ends with a period. Examples of 
words for which the FPER value is "yes" are abbreviations ending in a 
period and the last word of a sentence. 

The ILC ("Initial Lower Case") array stores a yes/no indication for each text 
25 word, where "yes" indicates that the word starts with a lower-case letter and 
the rest of the letters of the word are lower case. 

The IUC ("Initial Upper Case") array stores a yes/no indication for each text 
word, where "yes" indicates that the word starts with an upper-case letter 
30 and the rest of the letters are lower case. Examples of words for which the 
IUC value is "yes" are "San," "California," and the first wort of a sentence. 
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The AUC ("All Upper Case") array stores a yes/no indication for ach t xt 
word, where "yes" indicates that all letters in the word are upper-case. 
Examples of words for which the AUC value is "yes': CA, NASA. 

5 The PSNUM ("Pseudo-number") array stores a yes/no indication for each 
text word, where "yes" indicates that the word is a pseudo-number, which is 
any word that can sometimes be used in place of a number. Examples of 
pseudo-numbers, which are assigned a PSNUM value of "yes" are "several," 
"few," and "doz." For example, when processing the "ft." abbreviation for 
10 feet", "several ft.", "a doz. ft." and "5 ft." are all processed similarly. For 
purposes of generating the values for the PSNUM array, the classifier 
includes an internal table of pseudo-numbers and their associated standard 
abbreviations. 

1 5 The STPROV ("State or Province") array stores a yes/no indication for each 
text word, where "yes" indicates that the word is an abbreviation or full word 
or part of an abbreviation or full word that is potentially a U.S. state name or 
a Canadian province name. Examples of words which are assigned a 
STPROV value of "yes" are Ca, CA, California, Missouri, and British 

20 Columbia. Note that the last example is actually two words, both of which 
will be assigned a STPROV value of "yes". For purposes of identifying 
abbreviations and words that represent states and provinces, the classifier 
looks up abbreviations in the abbreviations translation table 146 and full text 
words in the city-state table 147. All abbreviations for state and province 

25 names are indicated in the abbreviations translation table 146 by a "rule 
type" of "state." When an abbreviation is found in the table 146 to have a 
rale type of "state" name that word is assigned an STPROV value of "yes." 
When a full text word (or set of adjacent words) is found in the state field of 
a record in the city-state table 147, then that word is assigned an STPROV 

30 value of "yes". 
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The PREPUNC ('Preceding Punctuation*) array stores a representation of all 
punctuation located at the beginning of a word, such as the left parenthes s 
at the beginning of a telephone number's area code. 



5 The PUNC ("Punctuation") array stores a representation of all punctuation 
located at the end of a word, such as a period at the end of an abbreviation. 
Examples of PUNC array values are: 
PUNC value Text 
in; 

10 ., e.g., 



The SEPR ("Separator") array stores a representation of the separator. If 
any, between the current word and the next, whether or not there are 
surrounding spaces. Examples: 
15 SEPR value Text 

+ 3+4 

1-16-95 

/ 1/16/95 and 3/4. 

20 The NUM (number") array stores a yes/no indication for each text word, 

where "yes" indicates that the word represents a number. In a number each 
character is an integer, period, or comma, possibly followed by a modifier 
like a plus sign (e.g., "20+ rules") or the letter K (e.g., "256K memory"). 

25 Finally, the QUAL array stores a character string that describes the potential 
type of the word. The different Qual values and what they represent are 
listed in Table 1. It should be noted that the values stored in the QUAL 
array represent the potential type of a word. The QUAL values are 
determined without consideration of the context in which the word is found. 

30 It is the Text Expander procedure 140 that considers the context of each 
word before determining what expansion of the word is required, if any. 
Furthermore, it is noted that when the classifier procedure 136 finds a word 
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in th abbreviations translation table 146, the rule type listed in the first 
matching record in the abbreviations translation table 146 is used as the 
QUAL value for that word. 

5 Text Expander • Second Pass Processing of Text 

After the Text Classification procedure 136 completes processing of a 
paragraph, the Text Reader Control procedure 132 calls the Text Expander 
procedure 140, which receives the initial text in the paragraph as well as the 
10 classification arrays 136 generated by the Text Classifier procedure 136 
(step 206). 

For each word in the text paragraph, the Text Expander procedure 140 first 
determines whether the word is (A) one of a set of frequently used 

15 unambiguous words found in table 144 (a preferred version of which is listed 
in Table 2) or (B) a number (208. 210). If the word is one of a set of 
frequently used unambiguous words found in table 144, it is passed directly 
to the TTS converter 130 via the control procedure 132. If the word is a 
number, the Number Expander procedure 142 is called (208) by the Text 

20 Expander procedure 140. 

The Number Expander procedure 142 processes not only the next word in 
the paragraph to be processed, but all the succeeding words that are part of 
the same logical quantity. Thus, a date or phone number, which is 
25 processed as a sequence of several words by the Text Classifier procedure 
136, will be processed with a single call to the Number Expander 142. 

The Number Expander procedure 142 evaluates punctuation and separators, 
as indicated in the classification arrays 136, as well as neighboring words 
30 before and after the number string being processed, to determine what type 
of number is represented by the received text, such as a telephone number, 
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soclal security number, time, date, fraction, monetary valu , or standard 
numeric value. 

For instance, the number string "94306" is likely to be a zip code if ft closely 
5 follows a city and/or state name, which is easily determined by looking at the 
STPROV and QUAL values for the preceding words, or by the words "zip 
code," in which case it should be spoken as a sequence of digits: "nine four 
three zero six". On the other hand, if the string "94306" is preceded by a 
dollar sign, then it should be spoken as a five digit dollar value: "ninety-four 
10 thousand, three hundred six dollars". The Number Expander procedure 

expands each number value to a full word string based on the determination 
of the type of number represented by the text, and passes the result to the 
TTS converter. 

15 Any word not processed by the Number Expander and not found in the table 
of frequently occurring unambiguous words is next processed by looking it 
up in the abbreviations translation table 146 (step 210). Each record in the 
abbreviations translation table 146 has the following three fields: 

20 keyword, expansion value, rule type 

where "keyword 11 is an abbreviation value, "expansion value" is a potential 
full word expansion of the abbreviation value, and "rule type" is a reference 
to which of the abbreviation expansion procedures 148 is to be executed 
25 (212) to determine whether or not the expansion value should be used. 

For instance, the abbreviation "sq" might have three entries in the 
abbreviations translation table 146: 

30 sq, square, qualifier 

sq, Square, street 
sq, squadron, military 
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wher "qualifier," "street," and "military," are the names f three distinct 
abbreviation expansion procedures 148 that are called "rules" for 
convenience. The "qualifier" rule is that the abbreviation value should be 
used if the abbreviation is preceded by a number or precedes a word whose 
5 QUAL value is "measure". The "street* rule is that the abbreviation value 
should be used if the abbreviation is preceded by a word having an initial 
upper case (i.e., if IUC is equal to "yes" for the preceding word). The 
"military* rule is that the abbreviation value should be used if the 
abbreviation Is preceded or followed (within 2 words) by a word or words 
10 such as Air Force, fighter, flight, bomber and so on. 

Another example of multiple entries in table 148 for a single abbreviation 
value is: 

15 mo, Missouri, state 

mo, Monday, weekday 
mo, month, measure 

where the "state" rule indicates that "mo" should be expanded to "Missouri" if 
20 (a) immediately followed by a 5-digit or 9-dig'rt integer, (b) immediately 

preceded by a city name in Missouri, (c) abbreviated by period and followed 
immediately by a lower case word (i.e., for which ILC is equal to "yes"), or 
(d) known to be part of an address. 

25 The "date" rule indicates that "mo" should be expanded to "Monday" If (a) 
adjacent to another weekday, (b) separated from another weekday or month 
word (I.e., a word whose QUAL value is "weekday" or "month") by a 
day-type integer (i.e., a word whose QUAL value is "day"), or (c) is adjacent 
to a time expression (i.e., a word whose QUAL value Is "time"). 

30 

The "measure" rule indicates that the abbreviation value ("mo" in this 
example) should be expanded to the expansion value ("month" in this 
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example) if (a) immediately preceded by a number or a number follow d by 
a m asure qualifier, r (b) preceded or foil wed by a word whose QUAL 
value is "day" or "year". 

5 if a won! is found in the abbreviations translation table and evaluation of the 
associated rule results in a determination that the abbreviation value should 
be used, then the resulting expansion value is passed to the TTS converter 
(212). 

10 Using the City-State Table 

Part (b) of the above mentioned "state" rule requires that the text expander 
procedure 140 have access to information regarding which cities are located 
in which states and provinces. City and state names are distinguished from 
15 other abbreviations, in part, through use of the city-state table 147 and in 
part through use of the abbreviations translation table 146. The city-state 
table 147 is a list of city-state pairs. That is, every record in the table 147 
has two fields: 

20 state name field, city name field 

where the state name field stores the full text name of a state or province 
and the city name field stores the full or abbreviated text of a city name in 
the state or province identified in the state name field. 

25 

As previously explained, when a word that is potentially a state name or 
province (or an abbreviation of a state name or province) is detected by the 
classifier procedure 136, the corresponding record in the STPROV array is 
set to a value of "Yes" and, if the word being analyzed is an abbreviation, 
30 then the full text name of the corresponding state name is stored in the 
corresponding record in the QUAL array. If a word (or pair of words) in the 
specified text represent the full text of a state or province name, then the 
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STPROV record f r that word (or words) is set to a valu of Y s, and the 
corresponding QUAL record(s) stores (store) th unexpended full text of the 
state or province name. 

5 When a word that is a city name (or potentially an abbreviation of a city 
name) is detected by the classifier procedure 136, the keyword "city" is 
stored by the classifier procedure in the corresponding record in the QUAL 
array. 

10 To further explain the use of the city-state table, we will consider how the 
enhanced text to speech reader 128 handles the three-word sequence 
"8 F, CA". A portion of the city-state table 147 includes the following 
records: 

California, san bemardino 
California, san diego 
California, san francisco 
California, san jose 
California, san mateo 

and two portions of the abbreviation table contain the following records: 

ca, California, statel 
ca, California, state3 
ca, circa, misc.abbr 
cal, California, state2 
catg, Calgary, city 
calif, California, State3, 
camb, Cambridge, city 



15 



20 



25 



30 



8 bem, San Bemardino, city 
8 c, South Carolina, Statel * 
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6 d, San Diego, city 
6 d, South Dakota, state 1 
8 d, South Dakota, state3 
8 dak, South Dakota, state3 
5 s diego, San Diego, city 

8 f, San Francisco, cKy 
8 f 9 Santa Fe, city 
8 f, Sioux Falls, city 
s j, San Jose, city 
10 s I, Salt Lake City, city 

8 mateo, San Mateo, city 
8 sgt, Staff Sergeant, title3 
8, South, direction 
8a, Saturday, weekday 

15 

sd, South Dakota, statel 



During the first pass through the text, which is performed by the text 
20 classifier procedure 136, the classifier marks the words S F as AUC (all 
upper case) and QUALscity, and marks the word "CA" as AUC, 
STPROVsYes, and QUAU'Califomia". The two words S F are identified as 
a city by the classifier by searching the abbreviation translation table 146. 
The first matching abbreviation translation record found by the classifier has 
25 a rule type of "city 11 , and that rule type is stored in the QUAL record for both 
words (S F). 

The second pass through the text, performed by the text expander 
procedure 140, finds the first record in the abbreviation table matching the 
30 words S F: 

8 f, San Francisco, city. 
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and therefore it applies the city abbreviation expansion oil . The city rule 
n t s that th words S F are all upper case (as indicated by their AUC 
values) and that these words are followed by an all upper case state name 
(AUC and STPROV for the next word, CA, are both equal to Yes). When 
5 this pattern of words is detected, the city rule searches the city-state table 
for a record that matches dtysSan Francisco, statesCalifomia. More 
generally, the city abbreviation expansion procedure searches for a match of 
dtyxexpansion value, state=QUAL value. 

10 Since a match for city»San Francisco, statesCalffomia is found in the 

city-state table, the expansion value "San Francisco" is accepted for the text 
message words S F. 

Next, the text expander procedure 140 processes the word CA. In the 
15 abbreviation translation table the first matching record found by the text 
expander is 

ca, California, statel 

20 and thus it executes the statel abbreviation expansion rule 148 to determine 
if the expansion text •California' should be accepted. The statel rule (i.e., 
procedure) computes several conditions, one of which is a check that the 
preceding word has either IUC or AUC equal to Yes and QUAUcity, thereby 
indicating that the preceding word(s) is(are) a city name. Since the 

25 preceding word(s) represent the city name San Francisco, the statel rule 
looks for a match in the city-state table for state-California, city=San 
Francisco. When it finds a matching record in the city-state table, the 
expansion text "California" is accepted as the expansion of the word "CA". 

30 If the text message had been 'San Francisco, CA", the QUAL value for the 
words "San Francisco" would have been "city" and IUC (Initial upper case) 
would hav been qualtoYes. When the words preceding a potential state 
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name have IUC equal to Yes, the city-state table is searched by the state 1 
rule for a record matching city=preceding words, state=pot ntial stat name. 
Since a matching record would be found in the city-state table for ctty=San 
Francisco, state*Califomia, the expansion value "California" would be 
5 accepted by the statel rule. 

If the word following S F had been SD (for South Dakota), then the text 
expander procedure 140 would not have found a match for state*South 
Dakota, cftysSan Francisco, and would therefore have rejected the 

10 abbreviation expansion "San Francisco" found in the first "s f record in the 
expansion translation table 146. Since no match would have been found, 
the text expander procedure 140 would then try the next potential expansion 
for the "S F" abbreviation as represented by the next "s f record in the 
abbreviation translation table 146. This process continues until either an 

15 abbreviation translation record is found for which the associated rule is 
satisfied, in which case the expansion text in the abbreviation translation 
record is accepted, or the original text is not expanded in accordance with 
any of the records in the abbreviation translation table 146. 

20 

Acronym/lnitialism Expander 

Next, any word not sent to the TTS converter by the Number Expander 
procedure 142, or due to its presence in the frequently used unambiguous 

25 words table 144 or due to a successful match with a record in the 
abbreviations translation table 146, is inspected to determine if it is an 
acronym or initialism (214). If not, the word is passed to the TTS converter 
(216). Words that are potentially acronyms or inhialisms must have at least 
two letters and include at least two capital letters. Furthermore, the 

30 Acronym/lnitialism expander 150 is not used to process upper-case words 
that are immediately preceded by or followed by an upper-case word (except 
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the words I r A) as this may signify emphasis ( .g., 'He said DO NOT do 
It.'). 

As explained above, an acronym is formed from pronounceable syllables, 
5 even if it represents a "made up word,' while an inhiaHsm is not 

pronounceable except as a sequence of letters and numbers. Thus, "IBM' 
and "YMCA" are initialisms, while 'NASA' and 'UNICEP are acronyms. 
Some words, such as "MSDOS' and "PCNET" are a mix of acronym and 
initialism components. 

10 

In the following discussion, the term "strip," as in "x is stripped off the word," 
means that the stripped off letters are to be treated as a separate unit or 
morph for further processing. "Stripped" off items are not discarded. 

15 The term "spelled out" means that the letters of the spelled out hem are 
stated in sequence. Thus, spelling out "PCN" means outputting the 
sequence 'P-C-N' to the TTS converter. 

The Acronym/lnitialism expander 150 performs the following functions. If the 
20 word being processed has both upper-case and lower-case letters, case is 
used to separate morphemes (herein called morphs). For example, 
LabVIEW is separated at the junction between lower-case and upper-case 
letters into Lab and VIEW. The resulting morphs are then treated 
separately. 

25 

Next, the Acronym/lnitialism expander 150 strips off any morphs found in 
morph table 152 from the word being processed (e.g., DEC, DOS, Info, Net), 
going through the word from left to right, except If the letters In the matching 
morph are part of a word in an exceptions list (e.g., December, Decor, Dose, 
30 Inform). Each of the stripped off morphs is treated separately. 
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The following then is applied to each of the morphs. When thre of the 
same I tter are in sequence, they are replaced with th words triple x' 
where x is the letter. Sequences of three or more adjacent consonants 
(other than three of the same letter in sequence) are spelled out. One-letter 
5 and two-letter sequences are also spelled out, with certain exceptions (e.g., 
BE, BY, DO). 

Using C to represent any consonant and V to represent any vowel, each 
morph is analyzed going from left to right. In a sequence 

10 

(C1) V1 C2 V2 

where parentheses are used to mean that the item in parentheses is 
optional, the morph is split after V1. In a sequence 

15 

(C1) V1 C2 C3 V2, the morph is split after C2. 

Initial vowels in sequences like V1 C2 V2 are stripped off (e.g., A-MEN, 
E-TON). 

20 

At this point we are left with either a C V ... or C V C ... syllable. If there is 
an unusual vowel sequence at the beginning or end of a syllable, e.g., ae, 
aa, K, ui as in SAE, CAA, Nil, DUI, the syllable is spelled out. 

25 Next, the procedure usually strips off and spells out an unusual consonant 
such as C before I or E in some cases, and X except before E. Whether 
stripping occurs depends on subsequent letters, e.g., CIO sequences are 
spelled except for those followed by ER as in CIDER. 

30 If the only thing left is a C V sequence of letters, e.g., PO in APO and RA in 
ERA, the Acronynvlnitialism procedure 150 spells out the remaining letters. 
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Next, going from left to right, the AcronynVlnitlallsm procedure 150 checks 
for acceptabl initial C V and C C V sequences ( .g., Low, Foe, Toe, Moe, 
Pro, Fro, Stow, Slow are all acceptable) and strips any unacceptable 
consonants until an acceptable sequence remains. This results in outputs to 
5 the TTS converter such as V-RAM, M-PEG, P-C-NET, where the consonants 
before the hyphens are consonants that have been stripped off. 

Whatever remains is then processed from right to left, checking for 
acceptable final C V and C C V sequences (e.g., Rope, Rose, Bone, Bore, 
10 Calm, Mild, Mint, Elf, Tenth, are acceptable). Unacceptable consonants are 
stripped from the end of the morph until an acceptable sequence remains. 
A final S (which may indicate a plural word) is not considered when 
performing this right to left processing step. This right to left check results in 
outputs to the TTS converter such as NET-X, FAS-B and HUM-V. 

15 

If the only remaining letters are two letters that fit the pattern CV, those 
letters are spelled out. Otherwise the remaining text is left unchanged. 

Finally, the resulting units or morphs are strung together with connecting 
20 hyphens, and the resulting string is passed to the TTS converter. In 

summary, the acronym/initiaJism expanding procedure 150 identifies words in 
the text message that are acronyms and initialisms, parses pronounceable 
syllables within the identified words and generates a substitute string 
comprising a sequence of units. Each unit In the generated sequence 
25 consists of a letter, number, pronounceable syllable or a multiple letter 
identifier. 

The TTS converter processes all strings passed to It In accordance with well 
known text to speech conversion procedures, thereby generating a verbal 
30 representation of the text message. In particular, the control procedure 132 
passes to the text to speech converter 130 for each word In the text 
message (A) the xpanslon value, If any, generated by the abbreviation 
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xpander procedure 140, (B) the substitute string, If any, generated by th 
acronynVinitialism expanding procedure 150, and oth rwis (C) the word. 

While the present invention has been described with reference to a few 
5 specific embodiments, the description is illustrative of the invention and is 
not to be construed as limiting the invention. Various modifications may 
occur to those skilled in the art without departing from the true spirit and 
scope of the invention as defined by the appended claims. For instance, it 
would be possible to make the classifier procedure a subprocedure of the 

10 text expander procedure such that the classifier is not used to process some 
words, such as state names following city names, when the text expansion 
process for the prior words has already expanded or inherently classified 
those words, and is used more efficiently in contexts where the text 
expander already knows some likely characteristics of a word from the prior 

15 word or words. 
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TABLE 1 
Classifier Data Structures 



Data 
s stoitium 

— all — 



10 



FPER 



ILC 



15 



IUC 



20 



25 



30 



AUC 



PSNUM 



STPROV 



35 PREPUNC 



40 PUNC 



Description of Record Contents 

Each data structure contains one record for each word in the 
filtered text 

yes/no indication of word ending with period. 

Examples of words for which value is yes: abbreviation ending 

in period; last word of a sentence. 

yes/no indication of Initial Lower Case, i.e., whether the word 
starts with a lower-case letter and the rest of the letters of the 
word are lower case. 
Example: house 

yes/no indication of Initial Upper Case, i.e., whether the word 
starts with an upper-case letter and the rest of the letters are 
lower case. 

Examples: San, California, He. 

yes/no indication of Ail Upper Case, i.e., whether all letters in 
the word are upper-case. 
Example: CA, NASA 

yes/no Indication of pseudo-number. A pseudo-number is any 
word that can sometimes be used in place of a number. 
Examples: several, few, doz. 

yes/no indication of an abbreviation or full word that potentially 
represents a U.S. state name or a Canadian province name. 
Examples: Ca, CA, California, Missouri, British Columbia. 

punctuation at the beginning of e word, such as the left 
parentheses at the beginning of a telephone number's area 
code. 

Example: "(' for the word "(415)". 

punctuation at the end of a word, such as a period at the end 
of an abbreviation. 

Examples: V for the word "in.'; for the word "e.g.,". 
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TABLE 1 f continued 

separator between the current word and the next, whether or 
not there are surrounding spaces. 

Examples: V for the words (344), •-■ for 1-16-95, and T for 
1/16/95 and 3/4. 

yea/no indication of a number. In a number each character is 
an Integer, period, or comma, possibly followed by a modifier 
like a plus sign (e.g., '20+ rules") or the letter K (e.g., *256K 
memory - ). 

a character string that describes the potential type of a word. 
QUAL Value 

time 

ordinal 

day 

oldyr 



year 
direction 



Description and Examples 
any time value, e.g., 3:45 
ordinal number, e.g., 21st, 3rd, fifth 
any number in the range 1-31 
any 2-digit number greater than 31 or any 
three digit number, e.g., 35, 250 
any 4-digit number less than 2500 
any compass direction indication, such as 
E, W, N, S, NE, NNW 
measure any unit of measurement such as ft., yd. 

spelled number any spelled out number, such as eleven 
measure modifier measurement modifier or qualifier, such as 
cubic, running, across, odd 
comparison words, such as older, colder 
item indication words, such as this, multiple 
any full word or abbreviated month 
any full word or abbreviated day of the 
week 

reference to something within a range of 
numeric values, such as grade, chapter 
beat, loss 

indication of a potential city name 
a value such as California, or Missouri is 
stored in the QUAL array for potential state 
and province names 

indicates an abbreviation that doesnl fall 
into other categories 
morning/evening indicator a.m., PM 
era indicator: BC, b.c, AD, a.d. 
any string that might be a roman number, 
such as XII, IV, xii 

a person's title, such as Mrs., Gov., Maj: 
street type indicator, such as road, rd., ave. 



comparison 
which 
month 
weekday 

range 

sports term 
city 

full state name 



misc_abbr. 

ampm 

bead 

Roman 

title 
street 
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TABLE2 

Frequently Used Unambiguous Words 

the 
5 of 

and 

you 

that 

was 
10 with 

his 

they 

this 

have 
15 by 

one 

had 

not 

but 
20 what 

were 

when 

there 

your 
25 which 

their 
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WHAT IS CLAIMED IS: 

1 . A computer system for converting a text message into audio signals, 
including: 

5 a text to speech converter that translates text into audio signals; 

a text reader control procedure for receiving said text message, said 
text message having a sequence of words; 

a classifier procedure, coupled to said text reader control procedure, 
that classifies specified words in said text message with regard to syntax, 
10 punctuation and membership in predefined classes of words, said predefined 
classes of words including numbers, measurement units, geographic 
designations, and date/time values; said classifier procedure generating 
classification values for said words in said text message; and 

an abbreviation expanding procedure that receives said classification 
15 values and said text message, identifies abbreviation words in said text 
message that match predefined abbreviations and identifying a set of 
potential expansion values for each said identified abbreviation word, 
evaluates based on said classification values for words neighboring said 
identified words which, if any, of said potential expansion values is 
20 applicable, and substituting said potential expansion value for said identified 
abbreviation word when said evaluation yields a success value; 

wherein said text reader control procedure includes instructions for 
receiving expansion values from said abbreviation expanding procedure, and 
passing to said text to speech converter for each word in said text message 
25 said expansion value, if any, generated by said abbreviation expanding 
procedure for said word. 

2. A computer system for converting a text message into audio signals, 
including: 

30 a text to speech converter that translates text into audio signals; 

a text reader control procedure for receiving said text message, said 
text message having a sequence of words; 
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an acronym/initialism expanding procedure, coupled to said text 
reader control procedure, that identifies words In said t xt message that are 
acronyms and inftiallsms, parsing pronounceable syllables within said 
Identified words and generating a substitute string comprising a sequence of 
5 units, each unit selected from the set consisting essentially of a letter, 
number, pronounceable syllable and multiple letter identifier, said 
acronym/initialism expanding procedure Including instructions for separating 
pronounceable syllables and individual letters not pronounceable as syllables 
from an identified word in said text message in accordance with predefined 

10 word parsing criteria; 

wherein said text reader control procedure includes instructions for 
receiving said substitute strings from said acronym/initialism expanding 
procedure, and passing to said text to speech converter for each word in 
said text message said substitute string, if any, generated by said 

15 acronym/initialism expanding procedure. 

3. A computer system for converting a text message into audio signals, 
including: 

a text to speech converter that translates text into audio signals; 
20 a text reader control procedure for receiving said text message, said 

text message having a sequence of words; 

a classifier procedure, coupled to said text reader control procedure, 
that classifies specified words in said text message with regard to syntax, 
punctuation and membership in predefined classes of words, said predefined 
25 classes of words including numbers, measurement units, geographic 
designations, and date/time values; said classifier procedure generating 
classification values for said words in said text message; 

an abbreviation expanding procedure that receives said classification 
values and said text message, identifies abbreviation words in said text 
30 message that match predefined abbreviations and identifying a set of 
potential expansion values for each said identified abbreviation word, 
evaluates based on said classification values for words neighboring said 
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identified words which, If any, of said potential expansion values is 
applicable, and substituting said potential xpanslon value for said identified 
abbreviation word when said evaluation yields a success value; and 

an acronym/inHiallsm expanding procedure, coupled to said classifier 

5 procedure, that identifies words in said text message that are acronyms and 
inltialisms, parsing pronounceable syllables within said identified words and 
generating a substitute string comprising a sequence of units, each unit 
selected from the set consisting essentially of a letter, number, 
pronounceable syllable and multiple letter identifier; 

10 wherein said text reader control procedure includes instructions for 

receiving expansion values from said abbreviation expanding procedure and 
said substitute strings from said acronym/initial ism expanding procedure, and 
passing to said text to speech converter for each word in said text message 
(A) said expansion value, if any, generated by said abbreviation expanding 

15 procedure for said word, (B) said substitute string, if any, generated by said 
acronym/initialism expanding procedure, and (C) said word otherwise. 
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I 
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04 
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206 
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